Can You Answer This? – Exploring Zero-Shot QA Generalization Capabilities in Large Language Models (Student Abstract)

نویسندگان

چکیده

The buzz around Transformer-based language models (TLM) such as BERT, RoBERTa, etc. is well-founded owing to their impressive results on an array of tasks. However, when applied areas needing specialized knowledge (closed-domain), medical, finance, performance takes drastic hits, sometimes more than older recurrent/convolutional counterparts. In this paper, we explore zero-shot capabilities large LMs for extractive QA. Our objective examine change in the face domain drift i.e. target data vastly different semantic and statistical properties from source attempt explain subsequent behavior. To end, present two studies paper while planning further experiments later down road. findings indicate flaws current generation TLM limiting closed-domain

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Can you identify this condition?

A 38-year-old man was referred to our clinic with a 2-week history of painful lesions located on the penis. He had no past medical history for this event. During the first few days of onset, he had a fever (38.5oC) without any other systemic symptoms. Treatment with ketoconazole cream and topical steroids had worsened the lesions. A physical examination revealed a number of small ulcers with a ...

متن کامل

Can you identify this condition?

A 13-year-old boy presents with asymptomatic lesions involving the mucosal aspects of his upper and lower lips. The lesions have been increasing in size and number over the past 5 months. He is not sexually active, is not taking any medications, and does not have a family history of a similar condition. Skin examination reveals multiple soft, mucosa-coloured papules, 2 to 8 mm in length, some w...

متن کامل

Can you Identify This Malignancy?

A Malignant peritoneal mesothelioma B Benign multicystic peritoneal mesothelioma C Well-differentiated papillary mesothelioma History Mr. B. is a 39-year-old male with no significant medical history who began experiencing progressive abdominal girth and bloating. Ultrasound and CT of the abdomen and pelvis revealed moderate to large abdominal ascites. Ultrasound-guided paracentesis yielded 7 L ...

متن کامل

One-Shot Generalization in Deep Generative Models

Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure, and then be able to generate compelling alternative variations of the concept. We develop machine learning systems with this important capacity by developing ...

متن کامل

When Will You Answer This? Estimating Response Time in Twitter

We present a study analyzing the response times of users to questions on Twitter. We investigate estimating these response times using an exponential distribution-based wait time model learned from users’ previous responses. Our analysis considers several different model building approaches, including personalized models for each user, general models built for all users, and time-sensitive mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i13.27019